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Asymptotic  Coverage  Distributions  on  the  Circle 

by 

Andrew  F.  Siegel 

1.  Introduction  and  Summaiy. 

Coverage  problems  arise  in  a wide  variety  of  applications.  The 
particular  problem  of  the  coverage  of  a circle  by  n random  equal  arcs  ^ 
each  of  length  a^^  has  been  associated  with  research  in  iramiinology 
(Moran  and  Fazekas  de  St.  Groth^  1962),  genetics  (Stevens^  1939)^ 
time  series  analysis  (Fisher^  19^0).  In  a previous  paper  (Siegel^  1978a) ^ 
exact  formulae  for  the  moments  and  distribution  of  coverage  were  obtained 
for  this  problem.  The  purpose  of  the  present  paper  is  to  explore  the 
asymptotic  behavior  of  the  coverage  di.stribution  for  large  n^  because 
the  exact  distributions  become  difficult  to  evaluate  numerically  in  this 
case.  It  is  hoped  that  these  limiting  distributions  will  prove  to  be 
useful  approximations  in  more  general  coverage  problems. 

Definitions^  notation^  and  a distributional  representation  of  the 
coverage  are  given  in  section  2.  The  asymptotic  distribution  of  the 
vacancy  is  found  in  sections  3 and  k,  under  different  behavior  of  the  arc 
length  sequence.  In  section  3,  the  arc  length  is  chosen  so  that  the 
probability  of  complete  coverage  of  the  circle  remains  fixed  as  n grows. 
The  limiting  distribution  is  found  to  be  a mixture  of  a discrete  mass 
point  at  zero  with  a positive  continuous  random  variable^  and  may  be 
interpreted  in  a natural  way  as  a noncentral  chi-square  distribution  with 
zero  degrees  of  freedom.  Section  4 treats  proportionately  smaller  arc 
lengths^  and  a limiting  normal  distribution  is  obtained. 
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2.  Definitions^  Notation^  and  a Distributional  Representation 
of  the  Coverage. 

Let  n arcs_,  each  of  length  a.,  be  placed  independently  with 
centers  uniformly  distributed  on  the  edge  of  a circle  of  circumference 
one.  Denote  these  arcs  by  The  coverage  is  defined  as 

n 

(2.1)  C(n,a)  = p(  U A.  ) 

i=l  ^ 

where  denotes  Lebesgue  measure  on  the  circle,  C(n,a)  is  the 
(random)  proportion  of  the  circle  that  is  contained  in  at  least  one 
arc.  The  vacancy  is  defined  as 

(2.2)  V(n,a)  = l-C(n^a) 

and  is  the  random  proportion  of  the  circle  that  is  contained  in  no  arc. 

It  is  introduced  because  it  is  generally  easier  to  work  with  mathematically 
than  is  the  coverage.  For  a thorough  treatment  of  the  foundations  of 
coverage  problems^  the  reader  is  referred  to  Ailam  (I966). 

The  event  V(n,a)  = 0 represents  complete  coverage  of  the  circle 
by  these  arcs.  Its  probability  will  be  denoted  by  P(n,a)^  and  was 
found  by  Stevens  (1939)  "to  be 

(2.3)  P(n,a)  = P(V(n,a)=0)  = £ (-1)^ (“) 

^=0 


where  (t)_j^  = max(t,0). 


2 


R.  A.  Fisher  (19^0)  discovered  a link  between  this  coverage  problem 
and  the  analysis  of  time  series^  from  which  we  obtain  a useful  representa- 
tion of  the  vacancy.  Let  be  independent  and  identically 

distributed  (or  alternatively,  any  exponential  distribution).  IJormalize 

them  so  that  they  sum  to  one  by  defining 

n 

(2.!>)  Y = xy  S X . 

d=i  ^ 

Then  a distributional  representation  of  the  vacancy  is  given  by 

n 

(2-5)  V(n,a)  = £ (Xi-a)+  • 

i=l 

This  follows  because  the  may  be  interpreted  as  the  spacings  between 

adjacent  counter-clockwise  endpoints  of  arcs,  as  is  illustrated  in 
figure  2,1. 
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3*  Limiting  Coverage  Distribution;  Constant  Coverage  Probability. 

In  this  section  "we  find  the  limiting  distribution  of  the  vacancy 
in  the  case  in  which  the  coverage  probability  stays  constant,  ihus 
for  an  experiment  in  which  n random  arcs  are  placed^  the  length  of 
each  arc  will  be  a and  is  chosen  so  that  P(n,a  ) = 7,  inhere  7 
is  the  fixed  coverage  probability  and  lies  strictly  between  zero  and 
one.  It  is  important  to  note  that  in  a given  experiment^  all  arcs 
placed  are  of  the  same  length.  The  treatment  of  random  arcs  of 
different  sizes  is  considerably  more  complicated  (Siegel^  1978b). 

In  theorem  3.1^  the  behavior  of  the  sequence  a^  is  characterized, 
and  in  theorem  3*2,  the  asymptotic  distribution  of  the  vacancy  is  found. 
Following  this  is  a discussion  of  the  interpretation  of  this  distribution 
as  a noncentral  chi-square  distiibution  with  zero  degrees  of  freedom. 

Theorem  3*1*  Let  P = log(l/7).  Then 

(3.1)  lim  P(n,  ilog(i-))  = 7 

n *♦  CO 


and 


(3.2) 


lim  n[a^  - ^ log(i-)]  = 0 

n oo 


where,  we  recall,  a satisfies  P(n, a ) = 7* 

1 n 

Note  that  (3*2)  is  a much  stronger  statement  than  ~ log(^), 

meaning  their  ratio  tends  to  one  in  the  limit.  A stronger  result  is 
clearly  needed  because  i log(^)  — log(^)  for  any  positive  and  Pp. 
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Proof.  Let  G = max  Y.  ^ using  the  representation  in  section  2. 

The  circle  is  covered  by  n arcs  of  length  ~ log(p)  if  and  only  if 
^n  — n Barton  and  David  (195^)  showed  that 

(3.3)  2n  exp(-nG^)  . 

Hence 

(3.4)  P(n,  i-  log(^))  = P(2n  exp(-nG^)  > 2P)  ■*  e~^  = 7 

and  (3.1)  is  established.  For  (3.2),  we  choose  and  'tiha't  satisfy 

0 < • Then  7^  = exp(-p2)  < 7 < exp(“P^)  = 7^  and  therefore, 

for  sufficiently  large  n we  have 

■('3.5)  P(n,  i log(|-))  < P(n,a^)  < P(n,  i-  log(^)) 

using  the  convergence  result  (3-1)  Just  established.  But  P(n,t)  is 
an  increasing  function  of  t,  so 

(3.6)  ilog(^)<a^<ilog(^) 

and  therefore  (for  sufficiently  large  n) 

(3.7)  ^°s(^)  < n[a^  - i log(^)]  < log(|-)  . 

By  choosing  and  close  to  P,  we  can  make  the  outer  terms  in 

(3.7)  as  near  to  zero  as  we  wish,  completing  the  proof.  || 
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where 


Theorem  3^2.  The  limiting  distribution  of  the  vacancy  V(n^a^), 
the  arc  lengths  a^  are  chosen  so  that  the  coverage  probability  remains 
fixed  at  7,  is  given  by 


(3.8) 


nV(n,a^) 


Y 


Where  Y is  the  mixture 


(3.9) 


' 0 probability  7 
probability  I-7 


and  Z is  continuous  with  density 


(3.10) 


fp(t)  = 


e^l 


Z=X 


i! 


-t 


_ 


1^(2  t/^) 


where  we  recall  P = log(l/7),  and  denotes  a modified  Bessel 

fimction  (see,  for  exajoiple,  chapter  9 of  Oliver,  196ii-).  The 
cumulative  distribution  function  of  Y is 


. 00  k o k 

(3.11)  F (t)  = P(Y  < t)  = e’^  S ^ [e-^  s 77] 

^ k=0  P.^0  • 

where  0 < t < 00  . 
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Proof.  Set 


(3.12) 


b^  = ^log(|)  . 


We  will  use  V(n,b^)  as  an  approximation  to  V(n,a^).  Moments  of 
vacancy  were  found  by  Siegel  (1978a),  and  are 


(3.13)  E[nV(n,bJ]“  = n“  (“+"-!) 


-1 


m+n-l 


+ 


It  may  be  verified  that 


(3.14) 


lim  (1  - log  |) 

n 00 


m+n-1 


= (^) 


by  taking  logs  and  doing  a Taylor  series  expansion.  Using  this,  the 
limit  of  (3.13)  may  be  calciiLated,  and  we  define 

m m .. 

(3.15)  = lim  E[nV(n,b^)]  = ml  J]  ^ ' 

n 00  ^=1 

The  moments  of  Y may  also  be  calculated: 


(3.16) 


ey“  = (1 


-r)  [° 

Jo 


t“fp(t)at  = e“^ 


I 

i=l 


JTu 


^mt0  "1 


e ^dt 
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■v^ere  we  use  the  monotone  convergence  theorem  in  order  to  exchange  sura  and 
integral.  The  integral  is  easily  done,  and  we  obtain 

Q CO  Jl, 

(3.17)  Ei  = e JTXiTyr  • 

•■6 

Now  if  we  expand  e , multiply  the  two  series,  and  gather  powers  of  P, 
we  get 

(3.18)  EY®  = ml  1 [ i (-l)^''^“2i^)(k)]  jr  ■ 

Z -1  k=l 

The  term  in  brackets  is  this,  simply  expand  the  identity 

(l4.t  )'^  (l-t  = (l-t)“^^”^'*’^^  and  equate  coefficients  of  Thus, 

comparing  with  (3-15)  we  have 

(3.19)  Ey“  = = lim  E[nV(n,b^)]“  . 

n 00 

Convergence  of  moments  implies  convergence  in  distribution 
provided 

_ 

(3.20)  lim  — < 00 

m -^co  ^ 

(see  section  8.12  of  Brieman,  I968),  To  establish  (3.20),  we  use 
Stirling*  s formula: 
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(3.21) 


ryi  n 

= “‘ jr^“- 


fTOm  Tifiiich  (3.20)  now  follows  easily.  Kiis  proves  that 


(3.22) 


nV(n,b^)  ^ Y . 


To  now  prove  (3.8)>  it  will  siiffice  to  show  that 


(3.23) 


Wn  = n[V(n,a^)*V(n,b^)] 


From  theorem  3*lj  a = b + — -where  c 0 . Using  this  and  the 

n n n n 

representation  of  section  2^  -we  have 


(3.2k)  P(|w^|  > s)  = jP(n|_I^((Yj^-\-^)_^-(Y3^-\)+)l  > e) 


<P(|cjJ^  i{Yi  > h^  - ^)  > e) 


■where  i{a}  is  1 if  A holds  and  0 otherwise.  Applying  the  Markov 
inequality  to  (3.2^4*)^  we  have 


(3o25) 


p(|wj  >,)<^[nP(Y^>h^--^)] 


It  can  be  verified  directly  that  if  0 < t < 1,  then 


(3.26) 


P(Y,  > t)  = (1-t) 
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Using  this,  we  now  show  that  the  bracketed  term  of  (3.25)  is  boxmded. 
This  term  is 


n-1 


(3.27) 


nP(Y.  > b —)  = n(l  - b + 

^1  n n ^ ^ n n ^ 


Taking  logs  and  expanding,  we  have 


(3.28) 


log(n)  + (n-1)  log(l  - + -^) 

log(n)  - (n-l){ilog(|)  - + 0(i2^)  } 


= logO)  + o(l)  . 


Using  this  in  (3.25)  and  recalling  that  0,  we  have 


(3.29) 


P(|W^!  > e)  < g 


n' 


0(1)  0 


completing  the  proof  of  (3.8).  (3.11)  follows  from  term-by-term 

integration  of  (3.10).  1| 


It  is  interesting  to  interpret  the  limiting  distribution  2Y  of 
2n  V(n,a^)  of  theorem  3.2  as  X^(f5),  a noncentral  chi-square  with 
zero  (l ) degrees  of  freedom  and  noncentrality  parameter  We  see  from 

(3.9)  and  (3.10)  that  2Y  is  a Poisson  mixture  of  the  central  chi-square 
deviates  where  K ~ Pq(P),  where  we  use  the  convention  that 

denotes  a point  mass  at  zero  (this  Dirac  delta  distribution  is,  in  fact, 
the  limit  of  X^  as  we  let  v decrease  to  zero.  ) This  representation 
easily  handles  the  dual  discrete-continuous  nature  of  2Y. 
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2 

Recall  the  representation  of  v a positive  integer,  as  a 

Poisson  mixture  of  the  central  chi-squares  ^+2K  ^ 

see,  for  example,  chapter  2.4  of  Searle  (l97l)*  Thus  we  see  by  analogy 
that  the  limiting  distribution  of  2n  V(n,a^)  naturally  extends  this 
definition  to  the  case  v = 0.  Having  zero  degrees  of  freedom  allows  a 
mixture  of  a discrete  mass  point  at  zero  (which  corresponds  to  the  event 
of  complete  coverage  of  the  circle)  with  continuous  variation  (which 
corresponds  to  partial  coverage  of  the  circle. ) 

4.  Proportionately  Smaller  Arcs. 

The  previous  section  treated  the  case  of  constant  coverage  probability, 
in  which  the  arc  size  behaved  like  ^ log(^).  Now  we  consider  proportionately 
smaller  arcs,  of  length 

(ii-.l)  = I log(|) 


where  0 < X < 1.  Because  in  this  case  the  coverage  probability  P(n,d^) 
tends  to  zero,  as  may  be  verified  from  (3«1  there  are  no  mass  points 
in  the  limiting  distribution  of  the  vacancy.  The  main  result  of  this 
section  is 
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Theorem  The  vacancy  V(n,d^)  is  asymptotically  normal  with  mean 

(p/n)^  and  variance  2P^n  That  is^ 


(^.2) 


V(n,dj^)-0/n)^ 

•j  2^^ 


^(0,1)  ♦ 


Proof.  We  will  use 


(4.3) 


* 


n X. 

V = y (^ - d 

n 

1=1 


^2n 


n^ 


+ 


as  an  approximation  to 


(4.4) 


V(n,dn)  = 


d=i 


where  are  independent  and  identically  distributed  from 

* 

the  representation  of  section  2,  Since  is  the  sample  mean  of  shifted 

and  censored  exponentiaJ-S^  it  is  easily  seen  that 


(^.5) 


E(V*)  = (P/n)^ 


and 

(4.6)  Var(V*)  = 2P^  - | (P/n)^]  . 

Thus  the  mean  and  variance  of  and  V(n^d^)  from  (4.2)  are 

asymptotically  identical.  The  proof  of  asymptotic  normality  will  follow 

* 

immediately  from  the  following  two  lemmas.  The  first  mil  show  that 
is  asymptotically  normal  with  the  right  mean  and  variance^  and  the  second 
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lemma  will  show  that  is  close  enou^  to  V(n,d^)  to  imply  (^.2). 

Lemma  4.1.  For  defined  in  (*t.3)  and 


V*-(P/n)^ 


n (0,1) . 


Proof.  It  will  suffice  to  verify  the  Lindeberg  condition  for  triangular 
arrays,  -^ich  may  be  found  in  Loeve  (i960).  This  requires  that 


(1^.8) 


g^Cs)  = nE  I 


n' 


> e)  -»  0 


hold  for  each  e > 0,  ^ere 


(4.9) 


n 


- H-h 


[|(|)  (1  - |(r)^)]' 


n 


1 

is  the  first  term,  (—  - in  the  sirni  for  V^,  normalized  to  have 

mean  zero  and  variance  l/n.  To  establish  (4.8),  we  bound  gj^(e)  using 
the  fourth  moment: 


(4.10) 


g (e)  < E 
^n^  ^ — 2 n 
e 


^4.2X 


X_  1 R ^ 

E[  (—  - d )^  - i-(~)  ] 
2n  n'+  .nn^ 


We  do  a binomial  expansion  of  the  expectation  term  and  calculate  the 
moments  from 
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(h.ii) 


X. 


E( 


a - d f = HLL  (p/nf  , m > 

^2n  n + ^ ^ ’ — 


Thus  (4.10)  becomes 


(l^.l2)  g^(e)  < 0(1)  • {SJl;- 

n 


^i-+2X  T+3^ 

n n 


= o(-f5:) 

n 


■v^ich  completes  the  proof. 


Lemma  4.2.  Using  the  notation  of  this  section^ 


* 


V(n,d  ) -V  p 


Proofs.  Using  the  representations  (4.3)  and  (4.4) 


{^.140  P(|  _E  - d^),J  > en- 


i=l  nX 


n 


1 ^ 

as  n 00  . for  each  e > 0,  where  X = — 5! 

1=1 

let  denote  the  event  in  (4.l4).  Because 


(^.15) 


n (—  - 1)  — * 

X 

n 


(0,1)  . 


given  5 > 0 we  can  find  K such  that 


(4.i6) 


P(0  < -3  - 1 < — ) > 1(1-6) 


1 . 


PP 1 

4+4x^ 

n 


y we  must  show  that 
(l+X)/2) 

X^.  For  convenience^ 
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and 


P(--<—  -1<0)>  i(l-6) 
/n  X 


n 


wherever  n is  sufficiently  large,  ihen  P(B^),  from  (4.l4),  satisfies 

(4.18)  P(B^)  < P(B^,  (1  +^)  < -|  < 1)  + P(B^,1  < -|  < (1  - |)  )+6  . 

/n  /n 

We  consider  the  two  main  terms  in  the  right  hand  side  of  (4.l8).  Bie 
first  one  is 


(4.19)  = P(  X (^  - (l  + |)  "^  < ^|  < l)  . 

i=l  nXj^  /n 

3 /2 

The  largest  any  single  difference  can  be  in  the  above  sum  is  KX^/2tl^  , 


X. 


-1  X. 


2nd 


and  this  difference  will  be  zero  whenever  X^  < 


n 


1+K/v/n 


• Thus 


(4e20) 


T. 


1 


n ^nd  ^ 1 \ /o 

< P(  y X.I{X.  > —]  > ^ . 

“ - ^ ^ l+K//n  ^ 


i=l 


The  Markov  inequality  yields 


(4.21) 


T,  < ^ E X,I(X,  > i^) 

1 - 2e  11 


^/2 


2nd 


n 


This  expectation  is  strai^tforward  to  calculate,  and  is 


{k.22) 


nd  A/ (1+K/v/n)  -(l/2+?)X 

2{i  + 2—3  (“)  < 2(l+nd^}  (p  ) 


1+K/v/n 


l6 


for  sufficiently  large  where  ^ is  any  number  strictly  between  0 

and  1/2.  Thus  the  first  term  of  (4 ,18)  satisfies 

^ X/2  -(l/2+^)\ 

(I+.23)  Tj_  < {1  + \ log(|))  (|)  = O(ii"^^log  n)  . 

The  second  main  term^  of  the  ri^t  hand  side  of  (4.l8)  may  be 

treated  similarly  to  obtain 


(4.24) 


T^  = OCn*"^^  log  n) 


as  well.  Thus 


(!)-.25)  = 6 + 0(n"^^  log  n) 

holds  for  all  5 > 0 and  ^ e (0^l/2)^  completing  the  proof.  1| 

Because  convergence  in  distribution  in  theorem  4.1  does  not,  by 
itself,  imply  proper  behavior  of  the  moments  of  V(n,d^),  this  is 
treated  in  the  following  theorem. 

Theorem  4.2.  The  asymptotic  mean  and  variance  of  V(n,d^)  are 
(4.26)  IJi^  = E V(n,d^)  (6/n)^ 


and 

(^+.27)  0-^  = Var(v(n,d^))  ~ 
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Proof.  Exact  formiilae  for  the  moments  are  available  in  Siegel  (1978a). 


The  first  moment  is 


(»t.28)  = (1-d^)"  = (1  - I log(|))“  . 

Expanding  log(iJL^)  in  a Taylor  series,  we  have 
(1^.29)  log(lJ.jj)  = logO/n)^  + 0(i-  log(|)  ) 


which  proves  (4.26).  The  exact  formula  for  the  variance  is 


(4.30) 


cr  = 
n 


2 

n+1 


(l-d  (l-2d  - (1-d 

^ n^  n+1  ^ n^  ^ n^ 


The  first  of  the  three  terms  on  the  ri^t  side  of  (4.30)  is 
~ 26^/n^*^^,  using  the  same  expansion  technique  we  just  used  for 
To  complete  the  proof,  we  will  show  that  the  remaining  two  terms  com- 
bined are  o(l/n^^^).  Write  the  second  term  from  (4.30)  as 


(4.31) 


n-1 

n+1 


2 

n+1 


(l-2d^)'^-"l 


- 2d  (l-2d  )“  + (l-2d  )“  . 


The  first  term  on  the  ri^t  is  0(l/n^'^^),  hence  o(l/n^^^),  and  may 
be  ignored.  The  second  term  is  0 (log(^)/n^^^ ),  hence  also  o(l/n^'^^). 
It  remains  only  to  consider  the  sinn  of  the  last  term  in  (4.30 ) with  the 
last  term  of  (4.31  )^  namely 


18 


(^.32) 


(i-2d  f - (1-a  . 

Factoring  (4*32)  as  a^~b^^  it  becomes 

o n~l  . ^ n-i-1 

i^-33)  - r I (l-2d  ) (l-2d  +d„) 

^ i=0  ^ ^ ^ 

The  sm  itself  is  bounded  above  by  (n-l)  = OCn^"^). 

Thus  (4,33 ) is  0(  (log(^))^/n^^^^)  = o(l/n^'^^)  as  was  to  be  shown. 
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